16. Text: Measures of Center and Spread Summary
Recap
Variable Types
We have covered a lot up to this point! We started with identifying data types as either
categorical
or
quantitative
. We then learned, we could identify quantitative variables as either
continuous
or
discrete
. We also found we could identify categorical variables as either
ordinal
or
nominal
.
Categorical Variables
When analyzing categorical variables, we commonly just look at the count or percent of a group that falls into each
level
of a category. For example, if we had two
levels
of a dog category:
lab
and
not lab
. We might say, 32% of the dogs were
lab
(percent), or we might say 32 of the 100 dogs I saw were labs (count).
However, the 4 aspects associated with describing quantitative variables are not used to describe categorical variables.
Quantitative Variables
Then we learned there are four main aspects used to describe
quantitative
variables:
- Measures of Center
- Measures of Spread
- Shape of the Distribution
- Outliers
We looked at calculating measures of
Center
- Means
- Medians
- Modes
We also looked at calculating measures of
Spread
- Range
- Interquartile Range
- Standard Deviation
- Variance
Calculating Variance
We saw that we could calculate the
variance
as:
\bold{\frac{1}{n}\sum\limits_{i=1}^n(x_i - \bar{x})^2}
You will also see:
\bold{\frac{1}{n-1}\sum\limits_{i=1}^n(x_i - \bar{x})^2}
The reason for this is beyond the scope of what we have covered thus far, but you can find an explanation here .
You can commonly find answers to your questions with a quick Google search . Now is a great time to get started with this practice! This answer should make more sense at the completion of this lesson.
Standard Deviation vs. Variance
The standard deviation is the square root of the variance. In practice, you usually use the standard deviation rather than the variance. The reason for this is because the standard deviation shares the same units with our original data, while the variance has squared units.
What Next?
In the next sections, we will be looking at the last two aspects of quantitative variables: shape and outliers . What we know about measures of center and measures of spread will assist in your understanding of these final two aspects.